External reproducibility of intentional harm evaluation (Score: 0)

Are the evaluations of the model’s risks related to intentional harm reproducible by external entities?

Note: For an evaluation to be reproducible by an external entity, we mean that the associated data is either (i) publicly available or (ii) described sufficiently such that a reasonable facsimile can be constructed by the external entity. In addition, the evaluation protocol should be sufficiently described such that if the evaluation is reproduced, any discrepancies with the developer's results can be resolved. We recognize that there does not exist an authoritative or consensus standard for what is required for an evaluation to be deemed externally reproducible. Evaluations on standard benchmarks are assumed to be sufficiently reproducible for the purposes of this index. We will award this point for reproducibility of multiple disclosed evaluations. In the event that an evaluation is not reproducible, a justification by the model developer for why it is not possible for the evaluation to be made reproducible may suffice.

References: https://huggingface.co/stabilityai/stable-video-diffusion-img2vid-xt

Justification: Not disclosed

New disclosure? Yes

Do users receive a justification when they are subject to an enforcement action for violating the usage policy?

Note: For example, does the developer disclose a protocol for telling users which part of the usage policy they violated, when they did so, and what specifically was violative? Enforcement actions refer to measures to limit a user’s ability to use the model, such as banning a user or restricting their ability to purchase tokens. We will award this point if the developer discloses that it gives justification for enforcement actions or, alternatively, if it discloses that it does not provide justification for enforcement actions or that it does not enforce its usage policy.

References: https://huggingface.co/stabilityai/stable-video-diffusion-img2vid-xt

Justification: No clear disclosure of whether justification is given in the wake of an enforcement action (e.g. a user being banned)

New disclosure? Yes

Usage policy violation appeals mechanism (Score: 0)

Is a mechanism for appealing potential usage policy violations disclosed?

Disclosure: Not disclosed

Note: We will award this point if the developer provides a usage policy violation appeals mechanism, regardless of whether it is provided via a user interface or distribution channel.

References: Not disclosed

Justification: Not disclosed

New disclosure? No

Permitted, restricted, and prohibited model behaviors (Score: 1)

Are model behaviors that are permitted, restricted, and prohibited disclosed?

Disclosure: Beyond the Acceptable Use Policy and other mitigations and conditions described here, the model is not subject to additional model behavior interventions of the type described in the Foundation Model Transparency Index.

Note: We refer to a policy that includes this information as a model behavior policy, or a developer's policy on what the foundation model can and cannot do (e.g. such a policy may prohibit a model from generating child sexual abuse material). We recognize that different developers may adopt different business models and that some business models may make enforcement of a model behavior policy more or less feasible. We will award this point if at least two of the three categories (i.e. permitted, restricted, and prohibited model behaviors) are disclosed. Alternatively, we will award this point if the developer reports that it does not impose any restrictions on its model's behavior.

References: https://huggingface.co/stabilityai/stable-video-diffusion-img2vid-xt

Justification: Not disclosed

New disclosure? Yes

Model behavior policy enforcement (Score: 1)

Is the enforcement protocol for the model behavior policy disclosed?

Note: By enforcement protocol, we refer to mechanisms for identifying whether model behavior is permitted or prohibited and actions that may arise in the event the model behavior policy is violated. For example, the developer may make updates to the model in response to issues with the model’s adherence to the model behavior policy. We will award this point if there is a clear description of the enforcement protocol, or if the developer reports that it does not enforce its model behavior policy or that it has no such restrictions on the model’s behavior.

References: https://huggingface.co/stabilityai/stable-video-diffusion-img2vid-xt

Justification: Not disclosed

New disclosure? Yes

Across all downstream applications, is the fraction of applications corresponding to each market sector disclosed?

Note: By market sector, we refer to an identifiable part of the economy. While established standards exist for describing market sectors, we recognize that developers may provide vague or informal characterizations of market impact. We will award this point if there is a meaningful, though potentially vague or incomplete, summary of affected market sectors.

References: Not disclosed

Justification: Not disclosed

New disclosure? Yes

Affected individuals (Score: 0)

Across all forms of downstream use, is the number of individuals affected by the foundation model disclosed?

Note: By affected individuals, we principally mean the number of potential users of applications. We recognize that there does not exist an authoritative or consensus standard for what qualifies as an affected individual. We will award this point if there is a meaningful estimate of the number of affected individuals along with a clear description of what it means for an individual to be affected by the model.

References: Not disclosed

Justification: Not disclosed

New disclosure? Yes

Usage reports (Score: 0)

Is a usage report that gives usage statistics describing the impact of the model on users disclosed?

Note: We recognize that there does not exist an authoritative or consensus standard for what is required in a usage report. Usage statistics might include, for example, a description of the major categories of harm that has been caused by use of the model. We will award this point if there is a meaningful, though potentially vague or incomplete, summary of usage statistics.

References: Not disclosed

Justification: Not disclosed

New disclosure? Yes

Geographic statistics (Score: 0)

Across all forms of downstream use, are statistics of model usage across geographies disclosed?

Note: We will award this point if there is a meaningful, though potentially incomplete or vague, disclosure of geographic usage statistics at the country-level.

References: Not disclosed

Justification: Not disclosed

New disclosure? Yes

Redress mechanism (Score: 0)

Is any mechanism to provide redress to users for harm disclosed?

Disclosure: Not disclosed

Note: We will also award this point if the developer reports it does not have any such redress mechanism.

References: Not disclosed

Justification: Not disclosed

New disclosure? No

Centralized documentation for downstream use (Score: 1)

Is documentation for downstream use centralized in a centralized artifact?

Disclosure: The information related to the model and its development process and usage protocols can be found in the GitHub repo, associated research paper, and HuggingFace model page/cards.

Note: Centralized documentation for downstream use refers to an artifact, or closely-linked artifacts, that consolidate relevant information for making use of or repurposing the model. Examples of these kinds of artifacts include a website with dedicated documentation information, a github repository with dedicated documentation information, and an ecosystem card. We recognize that different developers may take different approaches to centralizing information. We will award this point if there is a clearly-identified artifact(s) that contains the majority of substantive information (e.g. capabilities, limitations, risks, evaluations, distribution channels, model license, usage policies, model behavior policies, feedback and redress mechanisms, dependencies).

References: https://github.com/Stability-AI/generative-models/tree/main

Justification: GitHub satisfies this indicator

New disclosure? No

Documentation for responsible downstream use (Score: 1)

Is documentation for responsible downstream use disclosed?

Disclosure: The information related to the model and its development process and usage protocols can be found in the GitHub repo, associated research paper, and HuggingFace model page/cards. The released model inference & demo code has image-level watermarking enabled by default, which can be used to detect the outputs. This is done via the imWatermark Python library.

Note: Such documentation might include details on how to adjust API settings to promote responsible use, descriptions of how to implement mitigations, or guidelines for responsible use. We will also award this point if the developer states that it does not provide any such documentation. For example, the developer might state that the model is offered as is and downstream developers are accountable for using the model responsibly.

References: https://huggingface.co/stabilityai/stable-video-diffusion-img2vid-xt

Justification: See "the in-house NSFW filters" Stability refers to in its model card; it refers to LAION's NSFW filters in its other documentation https://github.com/LAION-AI/CLIP-based-NSFW-Detector

New disclosure? No